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Abstract 



Approximating a general formula from above and below by Horn formulas (its Horn 
envelope and Horn core, respectively) was proposed by Selman and Kautz (1991, 1996) as a 
form of "knowledge compilation," supporting rapid approximate reasoning; on the negative 
side, this scheme is static in that it supports no updates, and has certain complexity 
drawbacks pointed out by Kavvadias, Papadimitriou and Sideri (1993). On the other 
hand, the many frameworks and schemes proposed in the literature for theory update and 
revision are plagued by serious complexity-theoretic impediments, even in the Horn case, 
as was pointed out by Eiter and Gottlob (1992), and is further demonstrated in the present 
paper. More fundamentally, these schemes are not inductive, in that they may lose in a 
single update any positive properties of the represented sets of formulas (small size, Horn 
structure, etc.). In this paper we propose a new scheme, incremental recompilation, which 
combines Horn approximation and model-based updates; this scheme is inductive and very 
efficient, free of the problems facing its constituents. A set of formulas is represented by 
an upper and lower Horn approximation. To update, we replace the upper Horn formula 
by the Horn envelope of its minimum-change update, and similarly the lower one by the 
Horn core of its update; the key fact which enables this scheme is that Horn envelopes and 
cores are easy to compute when the underlying formula is the result of a minimum- change 
update of a Horn formula by a clause. We conjecture that efficient algorithms are possible 
for more complex updates. 

1. Introduction 

Starting with the ideas of Levesque (1986) in recent years there has been increasing interest 
in computational models for rapid approximate reasoning, based on a "vivid" (that is to say, 
conducive to efficient deductions) representation of knowledge. One important proposal in 
this regard has been the knowledge compilation idea of Selman and Kautz (1991, 1996), 
whereby a propositional formula is represented by its optimal upper (relaxed) and lower 
(strict) approximations by Horn formulas — the corresponding Horn formulas are called in 
the present paper the Horn envelope and the Horn core of the original formula. The key 
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idea of course is that, since these approximate theories are Horn, one can use them for rapid 
(linear-time) approximate reasoning. 

Despite the computational advantages and attractiveness of this idea, some obstacles 
to its implementation have been pointed out. First, as was noted by Selman and Kautz 
(1991, 1996), the Horn approximations are hard to compute in general, and may in fact be 
exponentially large when compared to the formula being approximated. Second, although 
the Horn envelope of a formula is unique up to equivalence, the Horn core is not; that is, 
there may be exponentially many inequivalent most relaxed Horn formulas implying the 
given one. As was proved by Kavvadias, Papadimitriou and Sideri (1993), selecting the 
one with the largest set of models, or one that is approximately optimal in this respect 
(within any bounded ratio), is NP-hard. Another disadvantage is that the Horn envelope 
may have to be exponentially larger, as a Boolean formula, than the given formula. What 
is more alarming is that, even if the Horn envelope is small, it may take exponential time to 
produce. Even if we are given the set of models of the original formula, there is no known 
output-polynomial algorithm for producing all clauses of the Horn envelope. (An algorithm 
is output-polynomial if it runs in time that is polynomial in both the size of its input 
and its output; this novel and little-studied concept of tractability — and, unfortunately, 
related concepts of intractability — have proved very relevant to various aspects of AI.) In 
fact, it was shown by Kavvadias, Papadimitriou and Sideri (1993) that generating the Horn 
envelope from the models of a formula is what we call in the present paper TRANSVERSAL- 
hard, suggesting that it is problematic whether it has an output-polynomial algorithm. 
These negative complexity results for knowledge compilation (admittedly, quite mild when 
compared with the serious obstacles to other approaches to knowledge representation and 
common-sense reasoning, e.g., Eiter & Gottlob 1992; 1993) are summarized without proof 
in Theorem 1. 

Our knowledge about the world changes dynamically — and the world itself changes as 
well. The knowledge compilation proposal contains no provisions for incorporating such 
belief revisions or updates. There are, of course, in the literature many formalisms for 
updating and revising knowledge bases and databases with incomplete information (Dalai 
1988; Satoh 1988; Borgida 1985; Weber 1985; Ginsberg 1986; Eiter & Gottlob 1992; Fagin, 
UUman & Vardi 1983; Winslett 1988; Winslett 1990; Forbus 1989). As was established 
by Eiter and Gottlob (1992), all these systems are plagued with tremendous complexity 
obstacles — even making the next inference, which is known as the counterfactual problem, is 
complete at some high level of the polynomial hierarchy for all of them. We point out in this 
paper (Theorem 2) some serious problems associated with computing the updated/revised 
formula in the two formula-based frameworks (Ginsberg 1986; Fagin, UUman & Vardi 1983; 
Winslett 1988) even if the formula being updated is Horn. The only ray of hope from Eiter 
and Gottlob (1992) — namely that when the formula is Horn, the update/revision is small, 
and the approach is any one of the model-based ones, then counterfactuals are easy — is 
tarnished by the observation that, in all these cases, the updated/revised formula is not 
Horn (this is part (iii) of Theorem 2); hence, such an update/revision scheme would fail 
to be inductive, that is, does not retain its positive computational properties in the face of 
change. 

To summarize, knowledge compilation of arbitrary formulas is not easy to do. And all 
known approaches to the update/revision problem encounter serious complexity obstacles. 
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or result in loss of the Horn property. What hope is there then for a system that supports 
both rapid approximate reasoning one? updates/revisions? 

Quite surprisingly, combining these two ideas, both shackled as they are by complexity- 
theoretic obstacles, seems to remove the obstacles from both, thus solving the combined 
problem, at least in some interesting cases which were heretofore believed to be intractable. 
In particular we propose the following scheme: Suppose that formula T is represented by its 
Horn envelope T and its Horn core F (to start the process, we incur a one-time computational 
cost for computing these bounds; alternatively, we may insist that we start with a Horn 
formula, in which case initially F = F = F). Suppose now that we update/revise our formula 
by (j), a "simple enough" formula (how "simple" it has to be for our scheme to be efficient 
is an important issue which we have only partially explored; we know how to handle a 
single Horn clause, as well as several other special cases). We represent the updated formula 
by the two formulas T + cj) and where ' + ' stands for an appropriate model-based 

update/revision formalism. That is, our updated/revised upper and lower bounds are the 
Horn envelope of the updated upper bound and the Horn core of the updated lower bound. 
These are our new F and F. In other words, we update/revise the two approximations, 
and approximate the two results, each in the safe direction. And so on, starting from 
the new approximations. The key technical point which makes this scheme work is that, 
although updating/revising Horn formulas, even by Horn clauses, does not preserve the 
Horn property, and finding Horn envelopes and cores is hard in general, it is easy when the 
formula to be approximated is the result of the update/revision of a Horn formula by a Horn 
clause. To our knowledge, our proposal, with all its restrictions, is the first computationally 
feasible approach to knowledge approximation and updates. 

As the following example suggests, our proposal exhibits a desirable and intuitively 
expected "minimum- change" behavior, best demonstrated in the case in which a Horn 
formula F is updated by a Horn clause, say cj) = {xlky z). Suppose that F can be written 
as xhyh^zhV , where F' does not involve x, y, oi z — if this is not possible, that is to say, 
if F does not contradict (j), then T -\- (j) = T<k,4>. Then the upper and lower approximations 
are these: T -\- (j) \s {xlky ^ z)^T' , while T + (j) is x<k,{y ^ z)^T' (or y^(x ^ z)^T' , recall 
that cores are not unique). Notice the attractive "circumscriptive" nature of the updates 
(resulting from the minimum- change update and revision formalisms that we are using). 

We conclude this introduction with a few disclaimers. As should be expected, and 
is pointed out in this paper, the computational feasibility of our approach comes with a 
"semantic price:" The upper and lower bounds F and F do not correspond in any natural 
way to some formula F; in fact, depending on the update formalism adopted, F may even fail 
to imply, as might be expected, F (see Theorem 5 and the example that follows). The pair 
(F, F) should be most accurately understood not as an efficient approximation of knowledge 
revision and updates, or an efficient dynamization of knowledge compilation, but instead as 
a new, combined, efficient approach to both the problems of vivid and dynamic knowledge. 
Its effectiveness as a knowledge representation formalism (that is, its semantic proximity to 
the situations being modeled, updated, and approximated, especially after a large number of 
updates) can only be tested experimentally, by applying it to typical or classical knowledge 
representation problems. Its apparent advantages are that (1) it comes with efficiency 
guarantees, and (2) it addresses — but of course does not provably solve — both dynamic 
and approximation aspects of the knowledge representation problem. Also, despite the fact 
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that our approach produces the next representation in linear time, there is no guarantee 
that the bounds will not become exponentially larger than the formulas handled or than 
necessary (for example, after repeated doublings of the size of the representation). Finally, 
although we do argue that our approach is surrounded by negative complexity results in 
almost all directions, we are of course not claiming that it is the only computationally 
feasible approach that is possible. 

2. Negative Results 

Let r be a propositional formula. Define (Selman and Kautz, 1991, 1996) its Horn envelope 
r to be a Horn formula such that (a) T |= F, and (b) there is no other Horn formula F' ^ F 
such that F |= F' |= F; that is, F is the strongest Horn formula implied by F; it is called the 
least Horn upper bound by Selman and Kautz (1991). Symmetrically, the Horn core to be 
the weakest Horn formula implying F (it is called the greatest Horn lower bound by Selman 
and Kautz, 1991). Naturally, one could not hope that the Horn envelope and core can be 
efficiently computed for all Boolean formulas. The reason is simple: F is unsatisfiable iff 
both F and F are unsatisfiable — and it is well known that Horn formulae can be checked 
for satisfiability in linear time. But what if F is given in some more convenient form, say in 
terms of its set of models /u(F) (that is, in "full disjunctive form")? A first problem is that 
F may have exponentially many clauses with respect to the size of /u(F) — there is little that 
can be done in this, we need them all to best approximate our formula. But can we hope 
to output these clauses, however many they may be, in time polynomial both in the size of 
input — /w(F) — and of the output — F? There are systematic ways that output all clauses 
of F, but unfortunately in all known algorithms there may be exponential delay between 
the production of two consecutive clauses. There is no known output-polynomial algorithm 
for this problem. 

There are many instances of such enumeration problems in the literature, for which no 
output-polynomial algorithm is known (despite the fact that, in contrast to NP-complete 
problems, it is trivial to output the first solution). The most famous one is to compute 
all transversals of a hypergraph (Eiter & Gottlob, 1995) (see the Appendix for a definition 
and discussion of this problem). As was pointed out by Eiter and Gottlob (1995), many 
enumeration problems arising in AI, databases, distributed computation, and other areas 
of Computer Science, turn out to be what we call in this paper TRANSVERSAL-hard, in 
the sense that, if they are solvable in output polynomial time, then the transversal problem 
is likewise solvable. It should be noted that recent research paints a rosier picture for the 
TRANSVERSAL problem, by showing that it can be done in output-subexponential time 
(Fredman & Khachiyan, 1996); but still, no output-polynomial algorithm is known. 

Theorem 1: Enumerating all clauses of the Horn envelope of a given set M of models is 
TRANSVERSAL-hard. As for the Horn core, selecting the Horn core (among the possibly 
exponentially many incomparable ones) with the maximum number of models (i.e., the one 
that best approximates M) is NP-complete; furthermore even approximating the maximum 
within any constant ratio is NP-complete. □ 

Proofs of these results are given by Kavvadias, Papadimitriou and Sideri (1993); a 
version of the second result, in a different model and cost criterion, was shown independently 
by Cadoli (1993). 
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The computational problems related to updates and belief revisions are in fact much 
harder. Let F be a set of Boolean formulas, and let (j) be another formula; (j) will usually 
be assumed to be of size bounded by a small constant k. We want to compute a new set of 
formulas T -\- (j) — intuitively, the result of updating or revising our knowledge base F by the 
new information (j). There are many formalisms in the literature for updating and revising 
knowledge bases. First, if F&(^ is satisfiable, then all (with the single exception of Winslett, 
1988) approaches define F + to be precisely F&(^ (we often blur the distinction between 
a set of formulas and their conjunction). So, suppose that F&(^ is unsatisfiable. 

1. In the approach introduced by Fagin, UUman, and Vardi (1983), and later elaborated 
on by Ginsberg (1986), we take F + to be not a single set of formulas, but the set 
of all maximal subsets of F that are consistent with (j), with (j) added to each. 

2. We shall consider here a more computationally meaningful variant called WIDTIO — 
for "when-in-doubt-throw-it-out" — in which F + is the intersection of the maximal 
sets mentioned in (1). 

3. The above approaches are syntactic, in that they define the updated formulas explic- 
itly. The remaining approaches are semantic, and they define T -\- (j) implicitly by its 
set of models ii{T + (j)), given in terms of the set of models of F, /u(F), and that of 
4>, l-i{<j)) — notice that, if F&(^ is unsatisfiable, these two sets are disjoint. All five 
approaches take ii{T + (j)) to be the projection of /u(F) on i-i{<j)), the subset of i-i{<j)) that 
is closest to /u(F) — and they differ in their notions of a "projection" and "closeness." 
In Satoh's (1988) and Dalal's (1988) models, the projection is the subset of i-i{<j)) that 
achieves minimal distance from any model in /u(F) (in Dalal's it is minimum Ham- 
ming distance, in Satoh's minimal set-theoretic difference). In Borgida's (1985) and 
Forbus's (1989) models, the projection is the subset of i-i{<j)) that achieves minimal 
distance from some model in /u(F) (in Forbus it is minimum Hamming distance, in 
Borgida's minimal set-theoretic difference). Finally, Winslett 's (1988) approach is a 
variant of Borgida's, in which the "projection" is preferred over the intersection even 
if F&(^ is satisfiable. 

Eiter and Gottlob (1992) embark on a systematic study of the complexity issues involved 
in the various formalisms for updates and revisions. They show that telling whether F-|-(^ |= 
if) in any of these approaches (this is known as the counter-factual problem) is complete for 
levels in the polynomial hierarchy beyond NP — that is to say, hopelessly complex, even 
harder than NP-complete problems. When F and (j) are Horn, and (j) is of bounded size, 
Eiter and Gottlob (1992) show their only positive result (for adverse complexity results, 
even in extremely simple cases, in approaches 1 and 2, see Theorem 2 parts (i) and (ii) 
below): The problem is polynomial in the approaches 3-7. This seems at first sight very 
promising, since we are interested in updating Horn approximations by bounded formulas. 
The problem is that the updated formulas cease being Horn (part (iii)). 

We summarize the negative results original to this paper as follows (see the Appendix 
for the proofs; point (iii) is an easy observation which we include for completeness): 

Theorem 2: Computing T + (j), where F is a set of Horn formulas and (j) is Horn formula: 

(i) Is TRANSVERSAL-hard in the Fagin- UUman- Vardi-Ginsberg (1983; 1986) approach. 
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11 -complete in the WIDTIO approach (that is, as hard as any problem 

that requires for its solution the interactive use of an NP oracle log ra times). 

(iii) May result in formulas that are not Horn in the model-based approaches. □ 

Regarding Part (ii), a coNP lower bound and an pNP[logra] ^ppg^ bound were shown by 
Eiter and Gottlob (1992). Liberatore (1995) shows that, unless the polynomial hierarchy 
collapses, Horn updates result in formulas with inherently exponential length. 

3. Incremental Recompilation 

We now describe our scheme for representing propositional knowledge in a manner that 
supports rapid approximate reasoning and minimum- change updates/revisions. At time i 
we represent our knowledge base with two Horn formulas T_- and Pj-. We start the process 
by computing the Horn envelope and core of the initial formula Pq, incurring a start-up 
computational cost — alternatively, we may insist that we always start with a Horn formula. 
Notice that we are slightly abusing notation, in that Pj and Pj- may not necessarily be the 
Horn envelope and core of some formula Pj-; they are simply convenient upper (weak) and 
lower (strict) bounds of the knowledge base being represented. 

When the formula is updated by the formula (^i, the new upper and lower bounds are 
as follows: 

Ti+i := Pi -|- <?!>i, 
r,+i := + (j), . 

Here '-|-' denotes any one of the update formalisms discussed (the effect of the update 
formalism on our scheme is discussed in Section 4). That is, the new upper bound is the 
Horn envelope of the updated upper bound, and the new lower bound is the Horn core of 
the updated lower bound. Obviously, implementing this knowledge representation proposal 
relies on computing the Horn envelopes and cores of updated Horn formulas. We therefore 
now turn to this computational problem. 

To understand the basic idea, suppose that we want to update a Horn formula P by a 
clause (j) = (-isV-iy). Pet us consider the interesting case in which P&(^ is unsatisfiable, and 
therefore P can be written as P = xhyhV for some Horn formula P' not involving x and y. 
Consider now any model m of P; it is of the form m = 11m', where 11 is the truth values 
of X and y, and m' is the remaining part of the model. The models of cj) that are closest 
to it (both in minimum Hamming distance and in minimal set difference, as dictated by all 
five approaches) are the two models 01m' and 10m'. Taking the union over all models of 
P, as the formalisms by Borgida and Forbus suggest, we conclude that T + cj), the updated 
formula, is (x y)<k,T' . It is easy to see that the Horn envelope of this formula is just 
(-is V -iy)&P', while the Horn core is either sfe-iy&P' or yfe-is&P' — we can choose either 
one of the two. 

As we mentioned in the introduction, if the update is a Horn implication, such as 
(j) = {xlky z) with P of the form xhyh^zhV , the upper and lower approximations are 
these: T -\- (j) \s {xlky ^ z)^T' , while T + cj) is x<k,{y ^ z)^T' or y^(x ^ z)^T' . The 
generalization to arbitrary Horn formulas is obvious. 
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Theorem 2: The Horn envelope and core of the update of a Horn formula T by cj), such 
that is a single Horn clause and T^cj) is unsatisfiable, in any one of the five model-based 
update formalisms 3-7 above, can be computed in linear time. 

Proof: First suppose cj) = (^xi V V ... V V Xk^i). Then, we can express T as 
xi^X2^ ■ ■ .^Xk^^Xk^iV , where V depends only on a;fc_|_2, . . . , V can be obtained in 
linear time by simply substituting the values xi = 1,X2 = 1,. . .,Xk = l,Xk^i = into T. 
Any model m of F is of the form 11 . . . 10m', where m' is a model of F'. The closest models 
of (j) to m (both in Hamming distance and minimality of set difference) are these (where all 
omitted bits are Is): 

Oil . . . 110m', 101 . . .110m', 111 . . . 100m', 111 . . .111m'. 

However, these are the models of the formula T + cj) = <i>&F' where 

$ = (^XiX2 . . .Xk^Xk+1 V Xi^X2 . . .Xk^Xk+1 V ... V XiX2 . . .^Xk^Xk+1 V XiX2 . . .XkXk+l). 

Hence, in all revision/update formalisms, T + cj) = F'&<i>. Since F' is a Horn formula, we 
have that T + cj) = $&F' and T + cj) = <i>&F'; we must therefore compute the envelope 
and core of It is not difficult to see that the possible cores of $ are the formulas 
X1X2 ■ ■ ■ Xi-iXi^i . . .Xk(xi ^ Xk^i) for i = 1, . . . , A;, and thus 

T + (/) = X1X2 . . .x,_ix,+i . . .Xkk(x, ^ Xk+i)kT'. 

On the other hand any model of the envelope of $ either has xi = X2 = ■ ■ ■ = Xk+i = 1 or 
it has Xk-\-i = and at least one of si, . . . , a;^ equal to 0, so we can write 

F + = {xk+i ^ X1X2 . . . Xk)(l)T'. 

If is a negative clause (i.e. there is no x^^i ) then similarly T + cj) can be 

-ia;i&a;2&a;3& . . .fes^feF' and T + cj) = (^F', 

or any such formula, with another one of {xi, . . . , x^} negated. □ 

4. Discussion 

Theorem 3 implies that incremental recompilation in the face of single Horn clause updates 
can be carried out very efficiently for in all model-theoretic formalisms, except for Winslett's. 
Can this scheme be efficiently extended to the case in which cj) has several Horn clauses? We 
next argue that the answer is negative. In fact, suppose that cj) is the conjunction of several 
negative clauses, with no positive literals in them, and that F is of the form sife . . . fes^feF', 
where xi . . .Xk are the variables appearing in cj). Consider a model 11 . . . Im' of F; what 
is the closest in Hamming distance model of cj)! The answer is the model that has zeros in 
those variables among xi . . .Xk which correspond to a minimum hitting set of the clauses 
(considered as sets of variables). Recall that a minimum hitting set of a family of sets is a 
set that intersects all sets in the family and is as small as possible. Finding the minimum 
hitting set of a family is a well-known NP-hard problem (Papadimitriou 1993). Therefore, 
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telling whether the Horn envelope of the updated formula (in the Forbus and Dalai models, 
which use Hamming distance) implies Xi is equivalent to asking whether i is not involved 
in any minimum-size hitting set — an coNP-complete problem! We have proved: 

Theorem 3: Computing the Horn envelope of the update of a Horn formula by the con- 
junction of negative clauses in the Forbus or Dalai formalisms is NP-hard. □ 

Notice, however, that this hardness result requires that the update cj) involve an un- 
bounded number of variables. We conjecture that the Horn envelope and core of a Horn 
formula updated by any formula involving a fixed number of variables can be computed in 
polynomial time in all five model-based update formalisms — although the polynomial may 
of course depend on the number of variables. In our view, this is an important and chal- 
lenging technical problem suggested by this work. We know the conjecture is true in several 
special cases — for example, the one whose unbounded variant was shown NP-complete in 
Theorem 4 — and we have some partial results and ideas that might work for the general 
case. 

4.1 The Choice of an Update Formalism 

Of the five model-based update formalisms, which one should we adopt as the update vehicle 
in our representation scheme? Besides computational efficiency (with respect to which 
there are very minor variations), there is another important desideratum: The property 
that Fj 1= Ti (that is, that the "upper and lower bound" indeed imply one another in the 
desirable direction) must be retained inductively. 

Definition: Fet '-|-' be a change formalism. We say that '-|-' is additive if for any formulas 
A, B and (p the following holds: {Ay B) + (p = {A + {B + (j)) 

Theorem 5: H F^ |= Fj-, and the update formalism used is additive, then |= Fi+i. 

Proof: Fet Aj- be such that Ti = Fj V Aj-. Then, we have: 
r,+i = F, + <i), = (F^VA^yr^ = (F, + <i)i) V (A, + <i)i) 
On the other hand: 

r,+i = F, + 4>, 1= F, 1= (F, + 4>i) V (A, + 4>i) 1= (F, + 4>i) v (A, + 4>i) 

and therefore T_i_^i |= Fi+i . □ 

Winslett's formalism satisfies the additivity condition. This is because, by definition, 
the set of models oiT -\- (j) under this formalism is the union over all models m of F of some 
function of m (namely, the set of models of (j) that are closest to m); hence, disjunction 
(that is, union of models) distributes over -|-. Unfortunately, Winslett's formalism is the 
only model-based formalism whose efficient implementation in the case of single Horn clause 
updates is left open by Theorem 3. As the following example demonstrates, the remaining 
four model-based formalisms, by treating exceptionally the case in which F and (j) 
consistent, are not additive, and may lead to situations in which the lower bound may fail 
to imply the upper bound: 

Example: Suppose that we start with this (non-Horn) formula: Fq = (a; V y)<^{x V z)<k,{y V 
-^z)<k,{^y\/ z). Then the Horn core and envelope may be Fq = ylkz and Fq = (yW ^z)^(^yW 
z). li we next update by the (j) = ~^x<k,^y and apply any of the four update/revision 
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formalisms other than Winslett's, we get = -^xh^yhz and Fi = -^xh^yh^z. This 
establishes that using our technique in any one of these four formalisms may result in an 
"upper bound" that fails to be implied by the corresponding "lower bound." □ 

The possibility of an upper bound that does not imply the lower bound is, of course, a 
major weakness of our scheme. Overcoming it is a very interesting open problem. The most 
satisfying (and, in our view, likely) way of overcoming it is by developing a polynomial-time 
algorithm for updating Horn formulas by clauses in Winslett's formalism. 

4.2 Characteristic Model Approximation 

Kautz, Kearns, and Selman (1993, 1995) introduced an interesting alternative way of rep- 
resenting Horn formulae, namely, characteristic models. Let F be a Horn formula, and let 
H be its set of models. It is easy to see that H = H* , where H* is the smallest set that 
contains H and is closed under component-wise multiplication (AND) of its models; that 
is, iff /ii,/i2 G H* implies hi AND /i2 G H*. This raises the possibility of the following 
alternative representation of H: We represent it by a minimal set of models C such that 
C* = H{= H*). This was first proposed by Kautz, Kearns, and Selman (1993); they called 
this set C the set of characteristic models of H , and they showed that it is exactly the set 
of all elements of H that cannot be represented as the AND of any subset of H . There are 
Horn sets that can be represented much more succinctly by characteristic models than by 
formulae, but there are also examples showing the opposite. One definite advantage of the 
characteristic models representation is that it allows for polynomial-time abduction (Kautz, 
Kearns, and Selman 1993, 1995). 

Our next result points out a disadvantage of the characteristic models over Horn formu- 
lae. This result also frustrates immediately the possibility that updates and revisions can 
be done efficiently through the characteristic models of the Horn core and envelope (for the 
proof see the Appendix): 

Theorem 6: Unless P=NP, the set of characteristic models of the intersection of two Horn 
sets of models. Hi fl H2, cannot be computed in polynomial time, given the characteristic 
models of Hi and those of H2. D 

Finally, it turns out that there is a similar approach to Horn approximation based on 
2SAT, that is, formulas with at most two literals per clause. Although such approximations 
are plagued with similar complexity impediments as their Horn counterparts, they also 
enjoy similar updatability properties as those we showed for Horn clauses in Theorem 3 — 
except that the complexity depends quadratically on the number of literals in the update; 
see Gogic (1996). 

4.3 Open Problems 

We presented in this paper a proposal for the problems of approximate reasoning and re- 
visions /upjdates. Although we have seen that each of the constituent problems is largely 
intractable, our work provides a computationally feasible and otherwise plausible way of 
approaching the combined problem. Although our positive complexity results are confined 
to single-clause updates, as far as we know, this is the first computationally feasible such 
proposal. 
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The main technical open problem raised by our work is to find polynomial-time algo- 
rithms for computing the Horn envelope and core of any Horn formula when updated/revised 
(in any of the five formalisms, most interestingly in Winslett's) by any bounded formula. 
We conjecture that such algorithms exist. 

Our approach responds to updates and revisions by producing approximations of the 
knowledge base which become, with new updates, more and more loose. Naturally, its 
practical applicability rests with the quality of these approximations, and their usefulness 
in reasoning. This important aspect of our proposal should be evaluated experimentally. 
A complementary way of evaluating the effectiveness of our scheme would be to apply it 
to well-studied situations and examples in AI in which reasoning in a dynamically updated 
world is well-known to be challenging, such as reasoning about action. 
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Appendix 

A hypergraph H = {V,E) is a finite set of nodes V, together with a set of hyperedges E, 
where each e G is a subset of V with at least two elements. Thus, a graph is a hypergraph 
in which all hyperedges are of cardinality two. A transversal t of a hypergraph is a minimal 
hitting set of the hyperedges of G, that is, a set of nodes that has a nonempty intersection 
with all hyperedges in E, and such that each proper subset is disjoint from some hyperedge. 
TRANSVERSAL is the following computational problem: Given a hypergraph, produce 
the set of all of its transversals. It is not known whether this problem can be solved in 
output-polynomial time, that is, in time polynomial in both the number of hyperedges and 
transversals. An enumeration problem is called TRANSVERSAL-hard if TRANSVERSAL 
can be reduced in polynomial time to it. 

Finally, the complexity class FP^-'^[^°S ^^^3 is the class of all functions that can be com- 
puted in polynomial time when given access to at most O(logra) times to an oracle that 
correctly answers 3SAT questions (or questions related to any other NP-complete problem). 

Theorem 2: Computing T + cj), where L is a set of Horn formulas and is a Horn formula: 

(i) Is TRANSVERSAL-hard in the Fagin-UUman-Vardi-Ginsberg (1983; 1986) approach. 

(ii) Is FP^"'^[^°S ''^l- complete in the WIDTIO approach (that is, as hard as any problem 

that requires for its solution the interactive use of an NP oracle logra times), (iii) May 

result in formulas that are not Horn in any one of the model-based approaches. 

Proof of Part (i): Let H = iV,E) be a hypergraph, where V = {l,2,...,ra} and E = 
{ei, . . . , em}- We first construct F and cj) in the following way: The set of variables will be 
X = {xi, . . . , Xn}- F = {^1, . . . ,gn} consists of the formulas gi = Xi for 1 < i < n. Finally, 
(j) consists of all clauses {^Xi^ V ... V ~^Xi^ ), where ej = {ii, . . . , i^^ } is an edge in E. 
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Claim 1: If i is a transversal of H then M = {gi : 1 < i < ra, i ^ is a maximal subset of 
r consistent with (j). 

Proof: We need to prove that M is consistent with (j) a-nd that adding any other gi will 
change that. Let v = (fi, . . . , f^) be a truth assignment to the variables in X such that 
t)j = if i G i and Vi = 1 otherwise. From the definition we see that v satisfies all formulas 
in M . On the other hand, take any clause in (j), say Cg = {^Xi^ V -iSj^ V ... V ~^Xi^). Then, 
edge e = (ii, . . . in E intersects t, say in vertex ii, which means that Vi^ = and 
therefore clause C'e is satisfied by v. So, v satisfies both cj) and M and therefore the two are 
consistent. 

Suppose now we add a function gi to M . From the definition of M we see that i £ t 
and therefore there is an edge e that does not intersect t — {i} (because Hs a transversal). 
But this now implies that all variables in C'e appear in M U {gi} which means that cj) and 
M U {gi} are inconsistent. □ 

Claim 2: If M = {^jj , . . . , ^ij.} is a maximal subset of F satisfying cj) then t = V — {ii, . . . ,ik} 
is a transversal of H . 

Proof: We first prove that t intersects all edges in E. Take an edge e and look at the clause 
C'e of (j). We know that C'e is consistent with M which means that there is some gj = Xj 
that does not belong to M while ^Xj is in C'e- Now, from the definition of t and C'e we see 
that j belongs to both t and e. 

Let us now prove that t cannot be reduced to any smaller set. Suppose that t — {i} 
is a transversal of H where i ^ {ii, . . . ,ik}. Then, from Claim 1 we see that M' = 
{di^dii^ ■ ■ -^dik) consistent with (j) which contradicts the fact that M is maximal. 

^From Claims 1 and 2 it follows that we have reduced the problem of finding the transver- 
sals of a hypergraph to the problem of finding all maximal subsets of F consistent with (j), 
which means that computing F + is TRANSVERSAF-hard in the Fagin-UUman-Vardi- 
Ginsberg approach. 

Part (ii): FP^-'^[^°S ^^^3 is the class of problems solvable in polynomial time with a Turing 

machine that can ask O(logra) questions to an NP oracle. The class is equivalent to the 
class of problems solvable by a polynomial time machine that can ask a linear number of 
questions to an NP oracle, but all in parallel. A problem complete for this class is: Given 
n Boolean formulas _Fi, . . . , _F„, compute a vector v = (fi, . . . , f„) such that Vi = 1 if and 
only if Fi is satisfiable. It is easy to see that instead of Boolean formulas we can give n 
instances of any NP-complete problem. 

To compute an update in the WIDTIO approach, it is enough to take every formula g 
of F and to ask the following question: 

Q: Can we choose a subset ofT — {g}, consistent with (j) when viewed alone, but inconsistent 
with (j) when enlarged by g? 

Since all formulas are Horn, these are n questions in NP that can be asked independently 
(in parallel), and therefore our problem is in FP^"'^[^°S ''^l . 

In order to prove FP^"'^[^°S ''^l-hardness, we will first prove that answering (Q) above is 
NP-hard. 
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Claim 3: Let T = {g,gi,g2, ■ ■ be a collection of Horn formulas and let cj) he a Horn 

formula. Then, telling whether g belongs to all maximal subsets of T consistent with cj) is 
coNP-complete. 

Proof: The proof is by reduction from a problem we call PURE3SAT (sometimes called, in 
our view, misleadingly, MONOTONE SAT), defined next. Call a clause pure if it contains 
only positive or only negative literals. A Boolean formula in CNF is called pure if it contains 
only pure clauses. PURE3SAT is the problem of deciding whether a pure 3CNF formula is 
satisfiable; it is known to be NP-complete (e.g. Garey & Johnson, 1979). 

Suppose _F is a pure 3CNF formula containing positive clauses Pi,...,Pr and neg- 
ative clauses Ni, . . . , N^, over the set of variables {xi, . . . ,Xn}- We introduce variables 
Xq, Xi , . . . , Xn, Yi, . . . ,Yr and define 

g = (^^YiV^Y2...V^Yr) 

g, = X,k(A^^<zpYj) for 1 < i < ra 

^ = ^{^x,v^xjV^x^)eFi^Xt V -iXj V -iXfc) 

We now prove that F is satisfiable if and only if there is a counterexample for g. 

Suppose that v = (vi, f2, . . . , Vn) is a satisfying assignment for F. We define F' = {gi : 
Vi = 1}, and we claim that g is inconsistent with F'. Since each positive clause Pj of F is 
satisfied, there must be at least one true Xi in Pj, and therefore each Yj is a conjunct of 
F'. But g states that at least one of the Yj^s must be false. On the other hand, F' alone 
is consistent with cj), because cj) essentially contains the negative clauses of P, and we know 
that the trth assignment satisfies these. So, F' is consistent with cj), yet it cannot be enlarged 
by g and therefore g does not belong to all maximal subsets. 

Conversely, suppose that such a F' exists. We define Vi = 1 if gi G F', and by the same 
line of reasoning as in the previous paragraph we prove that v = (vi, . . .,Vn) satisfies P, 
which concludes the proof of the claim. 

Let us take now n instances of PURE3SAT. For every instance Fi we build F' and cj)^ 
the way described in the lemma, over a new set of variables. Our F will be the union of all 
F"s, and cj) will be the conjunction of all (^"s. By updating F with cj) using the WIDTIO 
approach, we obtain a new formula that will contain g' if and only if Fi is unsatisfiable. 
Therefore, we can compute the answers to all n instances just by looking at the update, 
which shows that our problem is FP^"'^[^°S ''^l-hard. 

Part (iii): Let T = x A y and cj) = ^x W ^y. Notice first that T A (j) is not satisfiable. The 

set of models for F is {11} while the set of models for cj) is {00, 01, 10}. In all model-based 
approaches, the set of models of T + cj) is equal to {01, 10}, which obviously cannot be 
represented by a Horn formula. 

This completes the proof of Theorem 2. □ 

Theorem 6: Unless P=NP, the set of characteristic models oi Hir\H2 cannot be computed 
in polynomial time, given the characteristic models of Hi and those of H2. 

Proof: We shall establish this by proving that the following problem is NP-complete: 

MAXMODEL: Given two sets of models Mi and M2 that are the characteristic models 
representing Horn sets Hi and H2 respectively, and an integer k, determine whether there 
is a model in H3 = Hi f] H2 with more than k Is. 
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If MAXMODEL is NP-complete, then obviously one cannot compute in polynomial time 
the set of characteristic models of Hi fl H2, given the characteristic models of Hi and of 
H2, unless of course P=NP. The MAXMODEL problem is obviously in NP because we can 
simply guess a model m, check whether it contains at least k ones, and then check whether 
it belongs to Hi and • 

In order to prove NP-hardness of the problem, we show a reduction from NODE COVER. 
We are given a graph G = iV,E) with \V\ = n and \E\ = s, and an integer i, and we are 
asked whether there is a set of fewer than i nodes that cover all edges. We define two sets 
of characteristic models Mi and M2, as follows: For every edge = £ E we have in 

Ml a model h = (hi, . . . , h^j^n) such that h.^ = /is+i = while all other bits of h are 1, as 
well as a model h' = (h'l, . . such that h[ = h'^_^j = while all other bits of h' are 

1. In M2 we also add for every i between 1 and s a vector h = (hi, . . such that 

hi = . . . = hs = hgj^i = 0, and all other bits are 1. Our result now follows from this claim: 

Claim 4: Let k = n — I. G has a node cover of size less than I if and only if i^s = Hi^] H2 
has a model with more than k ones. 

Proof: Notice that H2 contains all vectors having at the first s positions and at least 
one among the last n positions. In order to obtain a vector in Hi that also belongs to 
H2 (and therefore to H3) we need to take vectors in Mi that will produce (under bitwise 
multiplication) a vector having on the first s positions. It is easy to see the idea behind 
our construction: If vector h in Hi has hj. = h^j^i = 0, it represents the fact that by putting 
node i in the cover set edge r is covered. So, if the set of vectors in Mi produces under 
bitwise multiplication a vector in H2, we have obtained a node cover in G, where a at 
position s -\- i means that node i is in the node cover. Both directions of the claim are now 
obvious. 

To return to the proof of Theorem 6, it follows from that it is NP-hard to find a maximal 
model of the intersection of two sets in the characteristic models representation. On the 
other hand, every maximal model belongs to the set of characteristic models, and therefore 
if we could find the intersection of two sets in the characteristic models representation we 
would be able to find the maximal model, completing the proof of Theorem 6. □ 
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